376 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English Portuguese Spanish french
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> sentences Production Status:
<Not Specified>
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Parallel Corpora for the Biomedical Domain
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Aurélie Névéol | LIMSI-CNRS | FR |
| Author 2 | Antonio Jimeno Yepes | IBM | AU |
| Author 3 | Mariana Neves | Hasso Plattner Institute | DE |
| Author 4 | Karin Verspoor | The University of Melbourne | AU |
| Main Contact | Aurélie Névéol | LIMSI-CNRS | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
American English German Modern Greek Portuguese Spanish
Availability:
Freely Available
License:
Creative Commons Attribution-ShareAlike 3.0 License and GNU Free Documentation License.
Size:
<Not Specified> <Not Specified>Production Status:
Existing-updated
Use:
Semantic Web
-
Paper title:DBpedia: A Multilingual Cross-domain Knowledge Base
-
Paper track:General issues
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Pablo Mendes | <Not Specified> | None |
| Author 2 | Max Jakob | <Not Specified> | None |
| Author 3 | Christian Bizer | <Not Specified> | None |
| Main Contact | Pablo Mendes | Freie Universität Berlin | DE |
Documentation:
http://dbpedia.org/Datasets/NLPLanguage Type:
Multilingual
Languages:
English German Portuguese Spanish italian
Availability:
Freely Available
License:
CreativeCommons
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
Word Sense Disambiguation
-
Paper title:Mapping WordNet Domains, WordNet Topics and Wikipedia Categories to Generate Multilingual Domain Specific Resources
-
Paper track:Terminology
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Spandana Gella | University of Melbourne | AU |
| Author 2 | Carlo Strapparava | FBK-irst | IT |
| Author 3 | Vivi Nastase | FBK | DE |
| Main Contact | Spandana Gella | University of Melbourne | None |
Documentation:
In Progress
Written
Ontology,
Language Type:
Multilingual
Languages:
English German Spanish french italian
Availability:
Freely Available
License:
CreativeCommons
Size:
1500000 concepts Production Status:
Existing-used
Use:
Knowledge Discovery/Representation
-
Paper title:Cross-lingual Knowledge Projection Using Machine Translation and Target-side Knowledge Base Completion
-
Paper track:NLP engineering experiment paper
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Naoki Otani | Carnegie Mellon University | US |
| Author 2 | Hirokazu Kiyomaru | Kyoto University | N/A |
| Author 3 | Daisuke Kawahara | Kyoto University | JP |
| Author 4 | Sadao Kurohashi | Kyoto University | JP |
| Main Contact | Naoki Otani | Carnegie Mellon University | None |
Documentation:
English documentation is publicly available: https://github.com/commonsense/conceptnet5/wikiLanguage Type:
Multilingual
Languages:
Basque German Spanish french italian
Availability:
From Data Center(s)
License:
META-SHARE and/or CC
Size:
1040 Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Arantza del Pozo | Vicomtech-IK4 | ES |
| Author 2 | Carlo Aliprandi | Synthema | IT |
| Author 3 | Aitor Álvarez | Vicomtech-IK4 | ES |
| Author 4 | Carlos Mendes | VoiceInteraction | PT |
| Author 5 | Joao P. Neto | VoiceInteraction | PT |
| Author 6 | Sérgio Paulo | VoiceInteraction | PT |
| Author 7 | Nicola Piccinini | Synthema | IT |
| Author 8 | Matteo Raffaelli | Synthema | IT |
| Main Contact | Arantza del Pozo | Vicomtech-IK4 | None |
Documentation:
Not available yet
Written
Corpus,
Language Type:
Multilingual
Languages:
American English German Spanish french italian
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:Extending HeidelTime for Temporal Expressions Referring to Historic Dates
-
Paper track:Written
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Jannik Strötgen | Heidelberg University | DE | Max-Planck-Institut für Informatik | DE |
| Author 2 | Thomas Bögel | Institute of Computer Science, Heidelberg University | DE | ||
| Author 3 | Julian Zell | Heidelberg University | DE | ||
| Author 4 | Ayser Armiti | Heidelberg University | DE | ||
| Author 5 | Tran Van Canh | Heidelberg University | DE | ||
| Author 6 | Michael Gertz | Heidelberg University | DE | ||
| Main Contact | Jannik Strötgen | Max-Planck-Institut für Informatik | None | Bosch Center for Artificial Intelligence | None |
Documentation:
http://code.google.com/p/heideltime/
Written
Corpus,
Language Type:
Multilingual
Languages:
English German Spanish Swedish
Availability:
Freely Available
License:
CreativeCommons (CC BY-SA 3.0)
Size:
90 GByte Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:WIKIPARQ: A Tabulated Wikipedia Resource Using the Parquet Format
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Marcus Klang | Lund University | SE |
| Author 2 | Pierre Nugues | Lund University | SE |
| Main Contact | Pierre Nugues | Lund University | None |
Documentation:
http://semantica.cs.lth.se/wikiparq/Language Type:
Multilingual
Languages:
Dutch English Spanish french italian
Availability:
Freely Available
License:
OpenSource
Size:
5 * 5000 Production Status:
Newly created-finished
Use:
Sentiment Analysis
-
Paper title:Generating Polarity Lexicons with WordNet propagation in 5 languages
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Isa Maks | VU University Amsterdam | NL | ||
| Author 2 | Ruben Izquierdo | VU University | NL | ||
| Author 3 | Francesca Frontini | PRAXILING - Université Paul-Valéry Montpellier 3 | FR | ILC CNR - Pisa Italy | FR |
| Author 4 | Rodrigo Agerri | IXA NLP Group, University of the Basque Country (UPV/EHU) | ES | ||
| Author 5 | Piek Vossen | VU University Amsterdam | NL | ||
| Author 6 | Andoni Azpeitia | vicomtech | ES | ||
| Main Contact | Isa Maks | VU University Amsterdam | None |
Documentation:
yes
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech Romanian Slovak Spanish Vietnamese
Availability:
Freely Available
License:
<Not Specified>
Size:
55 GByte Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Diacritics Restoration Using Neural Networks
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Jakub Náplava | Charles University, Institute of Formal and Applied Linguistics | CZ | ||
| Author 2 | Milan Straka | Charles University | None | ||
| Author 3 | Pavel Straňák | Charles University in Prague | CZ | ||
| Author 4 | Jan Hajic | Charles University in Prague | CZ | Charles University | CZ |
| Main Contact | Jakub Náplava | Charles University, Institute of Formal and Applied Linguistics | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Basque Catalan Galician Portuguese Spanish
Availability:
From Owner
License:
Part of the data will be distributed through LDC (not yet)
Size:
23.8 GByte Production Status:
Newly created-finished
Use:
Language Identification
-
Paper title:KALAKA-3: a database for the recognition of spoken European languages on YouTube audios
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Luis Javier Rodriguez-Fuentes | University of the Basque Country UPV/EHU | ES | ||
| Author 2 | Mikel Penagarikano | University of the Basque Country | ES | ||
| Author 3 | Amparo Varona | University of the Basque Country | ES | ||
| Author 4 | Mireia Diez | University of the Basque Country | CZ | ||
| Author 5 | German Bordel | University of the Basque Country | None | University of the Basque Country | ES |
| Main Contact | Luis Javier Rodriguez-Fuentes | University of the Basque Country UPV/EHU | None |
Documentation:
Albayzin 2012 Language Recognition Evaluation Plan (http://iberspeech2012.ii.uam.es/images/PDFs/albayzin_lre12_evalplan_v1.3_springer.pdf)




